Skip to content

refactor: migrate test infrastructure from shell scripts to Rust-based skill-bench#47

Merged
sonesuke merged 2 commits intomainfrom
refactor/skill-bench-test-infrastructure
Mar 30, 2026
Merged

refactor: migrate test infrastructure from shell scripts to Rust-based skill-bench#47
sonesuke merged 2 commits intomainfrom
refactor/skill-bench-test-infrastructure

Conversation

@sonesuke
Copy link
Copy Markdown
Owner

Summary

  • Remove shell script-based test infrastructure (agents/skill-bench/)
  • Add TOML-based test cases in tests/ directory
  • Convert test checks from shell scripts to skill-bench format
  • Update mise.toml with test task for skill-bench
  • Add logs/ directory with .gitkeep for test execution logs
  • Update .gitignore to exclude log files but keep .gitkeep

Test plan

  • Run mise run test to verify skill-bench tests execute correctly
  • Verify all test cases load without parsing errors
  • Check that log files are created in logs/ directory

🤖 Generated with Claude Code

web-flow and others added 2 commits March 30, 2026 02:31
…d skill-bench

- Remove shell script-based test infrastructure (agents/skill-bench/)
- Add TOML-based test cases in tests/ directory
- Convert test checks from shell scripts to skill-bench format
- Update mise.toml with test task for skill-bench
- Add logs/ directory with .gitkeep for test execution logs
- Update .gitignore to exclude log files but keep .gitkeep

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Update ANTHROPIC_DEFAULT_OPUS_MODEL to glm-5.1
- Update ANTHROPIC_DEFAULT_SONNET_MODEL to glm-5-turbo

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@sonesuke sonesuke merged commit 2cacff6 into main Mar 30, 2026
3 checks passed
@sonesuke sonesuke deleted the refactor/skill-bench-test-infrastructure branch March 30, 2026 02:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants